智能论文笔记

Quote Erat Demonstrandum: A Web Interface for Exploring the Quotebank Corpus

Vuk Vuković , Akhil Arora , Huan-Cheng Chang , Andreas Spitz , Robert West

分类：自然语言处理

2022-07-07

归因引号的使用是新闻中信息传播的最直接，最少过滤的途径。因此，引用在新闻报道的概念，接收和分析中起着核心作用。由于报价比常规报告提供了更直接的窗口，因此对于记者和研究人员来说，它们是宝贵的资源。尽管大量的研究工作已致力于自动提取新闻的报价及其归因于演讲者的方法，但很少有当代来源的全面归因报价可供公众提供。在这里，我们提出了一个自适应网络界面，用于搜索QuoteBank，这是新闻中的大量报价集合，我们可以在https://quotebank.dlab.tools上提供。

translated by 谷歌翻译

Strong Heuristics for Named Entity Linking

Marko Čuljak , Andreas Spitz , Robert West , Akhil Arora

分类：自然语言处理 | 机器学习

2022-07-06

由于看不见和新兴实体的频率，新闻中的命名实体链接（NEL）是一项具有挑战性的努力，因此需要使用无监督或零摄像的方法。但是，这种方法往往会带来警告，例如不整合新兴实体的合适知识库（例如Wikidata），缺乏可扩展性和不良的可解释性。在这里，我们考虑在Quotebank中的人歧义，这是新闻中大量的说话者归类的语言，并调查了NEL在网络规模的语料库中直观，轻巧且可扩展的启发式方法的适用性。我们表现最好的启发式歧义分别在Quotebank和Aida-Conll基准上分别占94％和63％。此外，提出的启发式方法与最先进的无监督和零摄像方法，本本系和MGenRE相比，从而成为无监督和零照片实体链接的强基础。

translated by 谷歌翻译

Efficient Entity Candidate Generation for Low-Resource Languages

Alberto García-Durán , Akhil Arora , Robert West

分类：自然语言处理 | 人工智能

2022-06-30

候选生成是实体链接中的重要模块。它在多个NLP任务中也起着关键作用，这些任务已被证明是有益地利用知识库的。然而，随着幼稚的方法获得很好的表现，它经常在单语的英语实体中被忽略。不幸的是，现有的英语方法不能成功地转移到资源不足的语言中。本文构成了对候选人生成问题的深入分析，即跨语性实体与关注低资源语言的关注。除其他贡献外，我们指出了先前工作中进行的评估的局限性。我们根据其难度将查询的特征介绍给类型，这提高了不同方法的性能的解释性。我们还提出了一个基于索引的构建，其设计是由基于更复杂的转移学习方法的动机，提出了一种轻巧而简单的解决方案。对2个评估设置下的9个现实世界数据集进行了彻底的经验分析表明，我们的简单解决方案在几乎所有数据集和查询类型的质量和效率方面都优于最先进的方法。

translated by 谷歌翻译

A systems design approach for the co-design of a humanoid robot arm

Akhil Sathuluri , Anand Vazhapilli Sureshbabu , Markus Zimmermann

分类：机器人

2022-12-29

Classically, the development of humanoid robots has been sequential and iterative. Such bottom-up design procedures rely heavily on intuition and are often biased by the designer's experience. Exploiting the non-linear coupled design space of robots is non-trivial and requires a systematic procedure for exploration. We adopt the top-down design strategy, the V-model, used in automotive and aerospace industries. Our co-design approach identifies non-intuitive designs from within the design space and obtains the maximum permissible range of the design variables as a solution space, to physically realise the obtained design. We show that by constructing the solution space, one can (1) decompose higher-level requirements onto sub-system-level requirements with tolerance, alleviating the "chicken-or-egg" problem during the design process, (2) decouple the robot's morphology from its controller, enabling greater design flexibility, (3) obtain independent sub-system level requirements, reducing the development time by parallelising the development process.

translated by 谷歌翻译

SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks

Suwon Shon , Siddhant Arora , Chyi-Jiunn Lin , Ankita Pasad , Felix Wu , Roshan Sharma , Wei-Lun Wu , Hung-Yi Lee , Karen Livescu , Shinji Watanabe

分类：自然语言处理

2022-12-20

Spoken language understanding (SLU) tasks have been studied for many decades in the speech research community, but have not received as much attention as lower-level tasks like speech and speaker recognition. In particular, there are not nearly as many SLU task benchmarks, and many of the existing ones use data that is not freely available to all researchers. Recent work has begun to introduce such benchmark datasets for several tasks. In this work, we introduce several new annotated SLU benchmark tasks based on freely available speech data, which complement existing benchmarks and address gaps in the SLU evaluation landscape. We contribute four tasks: question answering and summarization involve inference over longer speech sequences; named entity localization addresses the speech-specific task of locating the targeted content in the signal; dialog act classification identifies the function of a given speech utterance. We follow the blueprint of the Spoken Language Understanding Evaluation (SLUE) benchmark suite. In order to facilitate the development of SLU models that leverage the success of pre-trained speech representations, we will be publishing for each task (i) annotations for a relatively small fine-tuning set, (ii) annotated development and test sets, and (iii) baseline models for easy reproducibility and comparisons. In this work, we present the details of data collection and annotation and the performance of the baseline models. We also perform sensitivity analysis of pipeline models' performance (speech recognizer + text model) to the speech recognition accuracy, using more than 20 state-of-the-art speech recognition models.

translated by 谷歌翻译

Chaotic Variational Auto Encoder based One Class Classifier for Insurance Fraud Detection

K. S. N. V. K. Gangadhar , B. Akhil Kumar , Yelleti Vivek , Vadlamani Ravi

分类：机器学习

2022-12-15

Of late, insurance fraud detection has assumed immense significance owing to the huge financial & reputational losses fraud entails and the phenomenal success of the fraud detection techniques. Insurance is majorly divided into two categories: (i) Life and (ii) Non-life. Non-life insurance in turn includes health insurance and auto insurance among other things. In either of the categories, the fraud detection techniques should be designed in such a way that they capture as many fraudulent transactions as possible. Owing to the rarity of fraudulent transactions, in this paper, we propose a chaotic variational autoencoder (C-VAE to perform one-class classification (OCC) on genuine transactions. Here, we employed the logistic chaotic map to generate random noise in the latent space. The effectiveness of C-VAE is demonstrated on the health insurance fraud and auto insurance datasets. We considered vanilla Variational Auto Encoder (VAE) as the baseline. It is observed that C-VAE outperformed VAE in both datasets. C-VAE achieved a classification rate of 77.9% and 87.25% in health and automobile insurance datasets respectively. Further, the t-test conducted at 1% level of significance and 18 degrees of freedom infers that C-VAE is statistically significant than the VAE.

translated by 谷歌翻译

Structured Like a Language Model: Analysing AI as an Automated Subject

Liam Magee , Vanicka Arora , Luke Munn

分类：人工智能

2022-12-08

Drawing from the resources of psychoanalysis and critical media studies, in this paper we develop an analysis of Large Language Models (LLMs) as automated subjects. We argue the intentional fictional projection of subjectivity onto LLMs can yield an alternate frame through which AI behaviour, including its productions of bias and harm, can be analysed. First, we introduce language models, discuss their significance and risks, and outline our case for interpreting model design and outputs with support from psychoanalytic concepts. We trace a brief history of language models, culminating with the releases, in 2022, of systems that realise state-of-the-art natural language processing performance. We engage with one such system, OpenAI's InstructGPT, as a case study, detailing the layers of its construction and conducting exploratory and semi-structured interviews with chatbots. These interviews probe the model's moral imperatives to be helpful, truthful and harmless by design. The model acts, we argue, as the condensation of often competing social desires, articulated through the internet and harvested into training data, which must then be regulated and repressed. This foundational structure can however be redirected via prompting, so that the model comes to identify with, and transfer, its commitments to the immediate human subject before it. In turn, these automated productions of language can lead to the human subject projecting agency upon the model, effecting occasionally further forms of countertransference. We conclude that critical media methods and psychoanalytic theory together offer a productive frame for grasping the powerful new capacities of AI-driven language systems.

translated by 谷歌翻译

Spatio-Temporal Super-Resolution of Dynamical Systems using Physics-Informed Deep-Learning

Rajat Arora , Ankit Shrivastava

分类：机器学习

2022-12-08

This work presents a physics-informed deep learning-based super-resolution framework to enhance the spatio-temporal resolution of the solution of time-dependent partial differential equations (PDE). Prior works on deep learning-based super-resolution models have shown promise in accelerating engineering design by reducing the computational expense of traditional numerical schemes. However, these models heavily rely on the availability of high-resolution (HR) labeled data needed during training. In this work, we propose a physics-informed deep learning-based framework to enhance the spatial and temporal resolution of coarse-scale (both in space and time) PDE solutions without requiring any HR data. The framework consists of two trainable modules independently super-resolving the PDE solution, first in spatial and then in temporal direction. The physics based losses are implemented in a novel way to ensure tight coupling between the spatio-temporally refined outputs at different times and improve framework accuracy. We analyze the capability of the developed framework by investigating its performance on an elastodynamics problem. It is observed that the proposed framework can successfully super-resolve (both in space and time) the low-resolution PDE solutions while satisfying physics-based constraints and yielding high accuracy. Furthermore, the analysis and obtained speed-up show that the proposed framework is well-suited for integration with traditional numerical methods to reduce computational complexity during engineering design.

translated by 谷歌翻译

A Search and Detection Autonomous Drone System: from Design to Implementation

Mohammadjavad Khosravi , Rushiv Arora , Saeede Enayati , Hossein Pishro-Nik

分类：机器人 | 计算机视觉 | 机器学习

2022-11-29

Utilizing autonomous drones or unmanned aerial vehicles (UAVs) has shown great advantages over preceding methods in support of urgent scenarios such as search and rescue (SAR) and wildfire detection. In these operations, search efficiency in terms of the amount of time spent to find the target is crucial since with the passing of time the survivability of the missing person decreases or wildfire management becomes more difficult with disastrous consequences. In this work, it is considered a scenario where a drone is intended to search and detect a missing person (e.g., a hiker or a mountaineer) or a potential fire spot in a given area. In order to obtain the shortest path to the target, a general framework is provided to model the problem of target detection when the target's location is probabilistically known. To this end, two algorithms are proposed: Path planning and target detection. The path planning algorithm is based on Bayesian inference and the target detection is accomplished by means of a residual neural network (ResNet) trained on the image dataset captured by the drone as well as existing pictures and datasets on the web. Through simulation and experiment, the proposed path planning algorithm is compared with two benchmark algorithms. It is shown that the proposed algorithm significantly decreases the average time of the mission.

translated by 谷歌翻译

A Study on the Integration of Pre-trained SSL, ASR, LM and SLU Models for Spoken Language Understanding

Yifan Peng , Siddhant Arora , Yosuke Higuchi , Yushi Ueda , Sujay Kumar , Karthik Ganesan , Siddharth Dalmia , Xuankai Chang , Shinji Watanabe

分类：自然语言处理

2022-11-10

Collecting sufficient labeled data for spoken language understanding (SLU) is expensive and time-consuming. Recent studies achieved promising results by using pre-trained models in low-resource scenarios. Inspired by this, we aim to ask: which (if any) pre-training strategies can improve performance across SLU benchmarks? To answer this question, we employ four types of pre-trained models and their combinations for SLU. We leverage self-supervised speech and language models (LM) pre-trained on large quantities of unpaired data to extract strong speech and text representations. We also explore using supervised models pre-trained on larger external automatic speech recognition (ASR) or SLU corpora. We conduct extensive experiments on the SLU Evaluation (SLUE) benchmark and observe self-supervised pre-trained models to be more powerful, with pre-trained LM and speech models being most beneficial for the Sentiment Analysis and Named Entity Recognition task, respectively.

translated by 谷歌翻译